which had the following nucleotide sequence: 
hybridization orobe 

5'-TGGACACCTTCAGTATTGATGCCAAGACAGGCCAGGTCATTCTGCGTCGA-3' (SEQ ID NO:428). 

hi order to screen several libraries for a source of a fiall-length clone, DNA fix)m flie libraries was 
screened by PGR amplification with the PGR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0731 gene using the probe oligonucleotide and one of the PGR primers. RNA 
for construction of the cDNA libraries was isolated from human bone marrow tissue (LIB255). The cDNA 
libraries used to isolate the cDNA clones were constructed by standard methods using commercially available 
reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with oligo dT containing a NotI 
site, linked with blunt to Sail hemikmased adq)tors, cleaved with NotI, sized appropriately by gel 
electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or pRKD; 
pRE:5B is a precursor of pRK5D that does not contain the Sfil site; see. Holmes et al.. Science . 253 : 1278-1280 
(1991)) in the unique Xhol and NotI sites. 

DNA sequencing of the clones isolated as described above gave the fiill-lenglh DNA sequence for 
PR0731 [herein designated as UNQ395 (DNA4833 1-1329)] (SEQID NO:424) and the derived protein sequence 
forPR0731. 

The entire nucleotide sequence of UNQ395 (DNA4833 1-1329) is shown in Figure 170 (SEQ ID 
NO:424). Qone UNQ395 (DNA4833 1-1329) contains a single open reading frame with an apparent translational 
mitiation site at nucleotide positions 329-331 and ending at die stop codon at nucleotide positions 3881-3883 
(Figure 170). The predicted polypeptide precursor is 1184 amino acids long (Figure 171). The frill-length 
PR0731 protein shown in Figure 171 has an estimated molecular weight of about 129,022 daltoias and a pi of 
about 5.2. Clone UNQ395 (DNA48331-1329) was deposited with the ATCC on March 31, 1998. Regardmg 
the sequence, it is understood that the deposited clone contains the correct sequence, and the sequences provided 
herein are based on known sequencing techniques. 

Analysis of die amino acid sequence of the full-length PR0731 polypeptide suggests that portions of 
it possess significant identity and similarity to members of the protocadherin family, thereby iiadicating that 
PR0731 may be a novel protocadherin. 

Still analyzing the ammo acid sequence of SEQ ID NO:425, the putative signal peptide is at about ammo 
acids l-13ofSEQIDNO:425. The transmembrane domain is at ammo acids 719-739 of SEQ ID NO:425. The 
N-glycosylation of SEQ ID NO:425 are as fohows: 415-418, 582-586, 659-662, 662-665, and 857-860. The 
cadherin extracellular repeated domain signatures are at about amino acids (of SEQ ID NO:425): 123-133, 232- 
242, 340-350, 448-458, and 553-563. The corresponding nucleotides can be routinely determined given the 
sequences provided herein. 

EXAMPLE 69 : Isolation of cDNA Clones Encoding Human PR0218 

A consensus sequence was obtained relative to a variety of EST sequences as described ui Example 1 
above, wherein the consensus sequence obtamed is herein designated DNA1741 1 . Two proprietary Genentech 
EST sequences were employed in the consensus assembly and are shown in Figure 174 and 175. Based on the 
DNA1741 1 consensus sequence, oligonucleotides were synthesized: 1) to identify by PGR a cDNA library that 
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contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-lengfli coding sequence 
for PR0218. 

A pair of PGR primers (forward and reverse) were synthesized: 
forward PGR primer 5'-AAGTGGAGGCGGAGGCTTGG-3' (SEQ ED NO:433); 
reverse PGR primer S'-TCGTTGTTTATGr AGTArtTrrtrt-^' (SEQ ID NO:434). 
5 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA1741 1 
sequence which had the following nucleotide sequence: 
hybridization probe 

5'-ATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAGG-3' (SEQ ID NO:435). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
10 screened by PGR anqjlification with the PGR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0218 gene using the probe oligonucleotide and one of the PGR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal kidney tissue (LIB28). 

DNA sequencing' of the clones isolated as described above gave the full-length DNA sequence for 
PR0218 [herein designated as UNQi92 CDNA30867-1335)] (SEQ ID NO:429) and the derived protein sequence 
15 forPR0218. 

The entire nucleotide sequence of UNQ192 (DNA30867-1335) is shown in Figure 172 (SEQ ED 
g NO:429) . Clone UNQ192 (DNA30867-1335) contains a single open reading frame with an apparent translational 

initiation site at nucleotide positions 150-152 and ending at the stop codon at nucleotide position 1515-1517 
(Figure 172). The predicted polypeptide precursor is 455 amino acids long (Figure 173). The full-lenglh 
20 PR0218 protein shown m Figure 173 has an estimated molecular weight of about 52,917 daltons and a pi of 
about 9.5. Clone UNQ192 (DNA30867-1335) has been deposited with the ATCC on April 28, 1998. Regardmg 
the sequence, it is understood that the deposited clone contains the correct sequence, and the sequences provided 
herein are based on known sequencing techniques. 

Analysis of the ammo acid sequence of the full-length PR0218 polypeptide suggests that PR0218 may 
25 be a novel transmembrane protein. 

Stai analyzing the amino acid sequence of SEQ ID NO:430, the putative signal peptide is at about amino 
acids 1 through 23 of SEQ ID NO:430. Transmembrane domains are potentially at about amino acids 37-55, 
81-102, 150-168, 288-311, 338-356, 375-398, and 425-444 of SEQ ID NO:430. N-glycosylation sites are at 
about amino acids 67, 180, and 243 of SEQ ID NO:430. Eukaryotic cobalamin-binding protein is at about 
30 amino acids 151-160 of SEQ ID NO:430. The corresponding nucleotides can be routinely determined given the 
sequences provided herein. 

EXAMPLE 70 : Isolation of cDNA Clones Encoding Human PR0768 

A consensus sequence was obtamed relative to a variety of EST sequences as described in Example I 
35 above, whereia the consensus sequence obtained is herein designated DNA43448. Based on (be DNA43448 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PGR a cDNA library that contained 
the sequence of mterest, and 2) for use as probes to isolate a clone of the ftill-length coding sequence for 
PR0768. 
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A pair of PGR primers (forward and reverse) were syntiiesized: 
forward PGR primer 5'-GGCTGAGAGGGGAGTGCTCTTGAG-3' (SEQ ID NO:438); 
reverse PGR nrimftr 5'-GCTGGTGGGGAGTGCAATGTAGCTG-3' (SEQ ID NO:439). 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA43448 consensus 
sequence whidi liad the following nucleotide sequence: 
hybridization probe 

5'-CATCGTCCATGTCTCCGATGAGGTCTCTATTGGTCCACGAAGCATC-3' (SEQ ID NO:440). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PGR amplification with the PGR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0768 gene using the probe oligonucleotide and one of the PGR primers. RNA 
for construction of the cDNA libraries was isolated from human bone marrow tissue (LIB255). 

DNA sequencing of the clones isolated as described above gave flie fuU-length DNA sequence for 
PR0768 [herein designated as UNQ406 (DNA55737-1345)] (SEQ ID NO:436) aad the derived protein sequence 
for PR0768. 

Ihe entire nucleotide sequence of UNQ406 (DNA55737-1345) is shown in Figure 176 (SEQ ID 
NO:436). Clone UNQ406(DNA55737-1345) contains a single open reading frame with an apparent translational 
initiation site at nucleotide positions 20-22 and ending at the stop codon at nucleotide positions 3443-3445 
(Figure 176). The predicted polypeptide precursor is 1141 amino acids long figure 177). The full-length 
PR0768 protein shown in Figure 177 has an estimated molecular weight of about 124,671 daltons and a pi of 
about 5.82. Glone UNQ406 (DNA55737-1345) has been deposited with the ATCC on April 6, 1998. Regarding 
the sequence, it is understood that the deposited clone contains the correct sequence, and the sequences provided 
herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of She full-length PR0768 polypeptide suggests that portions of 
it possess significant sequence identity and similarity with integrin 7. 

StiU analyzing the amino acid sequence of SEQ ID NO:437, the putative signal peptide is at about amino 
acids 1-33 of SEQ ID NO:437. The transmembrane domain is at amino acids 1039-1064 of SEQ ID NO:437. 
N-glycosylation sites are at amino acids: 86-89, 746-749, 949-952, 985-988 and 1005-1008 of SEQ ID NO:437. 
Integrin alpha chain protein domains are identified at about amino acids: 1064-1071, 384-409, 1041-1071, 317- 
346, 443-465, 385^7, 215-224, 634-647, 85-99, 322-346, 470^79, 442-466, 379-408 and 1031-1047 of SEQ 
ID NO:437. The corresponding nucleotides can be routinely determined given the sequences provided herem. 



EXAMPLE 71 : Isolation of cDNA Glones Enco ding Hnma n PR0771 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 
above, wherem the consensus sequence obtained is herein designated DNA43330. Based on the DNA43330 
sequence, oUgonucleotides were synthesized: 1) to identify by PGR a cDNA library that contained the sequence 
of interest, and 2) for use as probes to isolate a clone of the fiill-length coding sequence for PR0771. 

A pair of PGR primers (forward and reverse) were synthesized: 
forward PGR nrimer 5'-GAGGAATATTGAGAAGGGGGAAGGG-3' (SEQ ID NO:443); 
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